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Abstract 



We present a new family of estimators of the Weibull tail-coefficient. The Weibull 
tail-coefficient is defined as the regular variation coefficient of the inverse failure rate 
function. Our estimators are based on a linear combination of log-spacings of the upper 
^-H ' order statistics. Their asymptotic normality is established and illustrated for two par- 

' ticular cases of estimators in this family. Their finite sample performances are presented 

, on a simulation study. 

» 
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^ . 1 Introduction 

> 

, Weibull tail-distributions encompass a variety of light tailed distributions, i.e. distributions 

' in the Gumbel maximum domain of attraction, see [12j for further details. Weibull tail- 

X/-^ ', distributions include for instance Weibulls, Gaussians, gammas and logistic. The purpose of 

this paper is to study the estimation of a tail parameter associated with these distributions. 
, More precisely, a cumulative distribution function F has a Weibull tail if its logarithmic tail 

satisfies the following property: There exists 9 > such that for all A > 0, 



hm M^^Ii^ = (1) 

t^oo log(l - F{t)) ^ ' 



?-H ■ The parameter of interest Q is called the Weibull tail-coefficient. Such distributions are 



of great use to model large claims in non-life insurance [5]. In the particular case where 
log(l - F(At))/log(l - F{t)) = A^/^ for aU t > and A > 0, estimating B reduces to esti- 
mating the shape parameter of a Weibull distribution. In this context, simple and efficient 
methods exist, see for instance [2], Chapter 4 for a review on this topic. Otherwise, dedi- 
cated estimation methods have been proposed since the relevant information on the Weibull 
tail-coefficient is only contained in the extreme upper part of the sample. A first direction 
was investigated in [7] where an estimator based on the record values is proposed. Another 
family of approaches [H IH [6j [16] consists of using the fc„ upper order statistics where (fc„) is 
an intermediate sequence of integers i. e. such that 

lim fc„ = oo and lim k„/n = 0. (2) 

Note that, since 9 is defined only by an asymptotic behavior of the tail, the estimator should 
use the only extreme- values of the sample and thus the second part of ^ is required. The 
estimators considered here belong to this approach. Let (Xi)i<i<„ be a sequence of indepen- 
dent and identically distributed random variables with cumulative distribution function F. 
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Denoting by Xi^n < ■ ■ • < ^n,n the corresponding order statistics, our family of estimators is 

) - log(X„_fe„+i,„)) / ^ ai,„(loglog(n/i) - loglog(n/A:„)) 

i=l / 1=1 

(3) 

with weights ai,n = W{i/kn)+ei^n defined from W a smooth score function and (ei,n)i<i<fe„-i 
a non-random sequence. We refer to [10[|23j for similar works in the context of the estimation 
of the extreme-value index. 

In Section [2] we state the asymptotic normality of these estimators. In Section [3l we 
provide two examples of weights. The first one leads to the estimator of 9 proposed by 
Beirlant et al. [S]. The second one gives rise to a new estimator for WcibuU tail-distributions. 
The behavior of these two estimators is investigated on finite sample situations in . Finally, 
proofs are given in Section |4l 



2 Asymptotic normality 

Consider the failure rate H = — log(l — F). Writing its generalized inverse H^{t) = 
inf{x, H{x) > t}, assumption ([I} is equivalent to: 

(A.l) H^{t) =t^e{t), 

where £ is a slowly varying function i.e. such that £{Xt)/£{t) — > 1 as t cxo for all A > 0. 
The inverse failure rate function is said to be regularly varying at infinity with index 9 
and this property is denoted by e TZg. We refer to [5] for more information on regular 
variation theory. As a comparison, Pareto type distributions satisfy (1/(1 — i^))^ G TZj, 
and 7 > is the so-called extreme-value index. As often in extreme-value theory, (A.l) is 
not sufficient to prove a central limit theorem for 0„(a). It needs to be strengthened with a 
second order condition on ^, namely that there exist p < and a function b with limit at 
infinity such that 

(A.2) \og{£{\t)/e{t)) - b{t)J^uP-^du, 

uniformly locally on A > 1 and as i oo. The second order parameter p < tunes the rate of 
convergence of i{Xt)/£{t) to 1. The closer p is to 0, the slower is the convergence. Condition 
(A.2) is the cornerstone in all proofs of asymptotic normality for extreme- value estimators. 
It is used in [151 HZl 12] to prove the asymptotic normality of estimators of the extreme- 
value index 7. Table [T] shows that many distributions satisfy (A.l) and (A.2). Among 
them, Extended Weibull distributions, introduced in [TO], encompass gamma, Gaussian and 
Benktander II distributions. We refer to [TT1, Table 3.4.4, for the derivation of b{x) and p in 
each case. Other examples are the Weibull, logistic and extreme- value (with shape parameter 
7 = 0) distributions. 

Throughout the paper, we write Id for the identity function. In particular, if / is a 
function and p a real number, the inequality / < Id'' means f{t) < t'P for any p where it is 
defined. For general L-estimators, conditions on the weights are required to obtain a central 
limit theorem (see for instance [H]). Our assumptions are the following: 

(A. 3) W is defined and continuously differentiable on the open unit interval, 

(A.4) There exist M > 0, < g < 1/2 and p < I such that \W\ < Mld^'^ and \W'\ < 
MId~^~'^ on the open unit interval. 

Similar conditions have been introduced in the context of the estimation of the extreme- value 
index |101 123] . To write the limiting variance of 9n{a), we introduce two quantities: 

H{W) = / W{x)\og{l/x)dx, 
Jo 

o\W) = f f'w{x)W{y r''^'^y^-''y dxdy. 
Jo Jo xy 
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We also define ||e||n,oo — max |ei,n|- We are now in position to state our main result. 

i—l,...,kn—l ' 

Its proof is postponed to Section 2] 

Theorem 1 Suppose (A.1)-(A.4) hold. If (kn) is any intermediate sequence such that 

fcy25(log(n)) A and fc^^ max{l/ log(n), ||e||„,oo} ^ 0, (4) 

then 

kl'^iUa) -9) A AA(A, e'a'{W)/fi\W)). 

Clearly, the bias of the estimator is driven by the function b. This bias term asymptotically 
vanishes if A = 0. Some applications of this result are given in the next section, Corollary [T] 
and Corollary[5] The importance of the bias term is also illustrated on finite sample situations. 
Finally, note that condition ^ implies fc„/n ^> 0. 

3 Comparison of two estimators 

First, we show in Paragraph l3.1[ that our family of estimators ([3]) encompasses the Hill type 
estimator 9^ proposed in [6]. Moreover, it will appear in Corollary [T] that the asymptotic 
normality of 6^ stated in [161, Theorem 2 is a consequence of our main result Theorem 1. 
Second, in Paragraph l3.2[ we use our framework to exhibit a new estimator of the Weibull tail- 
coefficient and to establish its asymptotic normality in Corollary (3] In the third paragraph, 
we show that the new estimator performs as well as the Hill one. 

3.1 Hill type estimator 

Beirlant et al. propose the following estimator of the Weibull tail-coefficient: 

= E (log(^«-.+i,«) - log(X„_fe„+i,„)) / (log log {n/i) - log log (n/fc„)) ■ (5) 

i=l I 1=1 

Clearly, 9^^ is a particular case of 9n{a) with W{x) = 1 for all x G [0,1] and Si^n = for 
all i = 1, . . . , fc„. The asymptotic normality of 9^ , established in Theorem 2 of [T^, can be 
obtained as a consequence of Theorem [T] 

Corollary 1 Suppose (A.l) and (A. 2) hold. If (fc„) is an intermediate sequence such that 
kll"^ max{6(log(n)), 1/ log(n)} ^ 0, then k)l'^(B^ - 6) A M{0, 6^). 



3.2 Zipf estimator 

We propose a new estimator of the Weibull tail-coefficient based on a quantile plot adapted to 
our situation. It consists of drawing the pairs (log log (n/i) , log (X„_i+i^„)) for i — l,...,n— 1. 
The resulting graph should be approximatively linear (with slope 6), at least for the large 
values of i. Thus, we introduce 9^ the least square estimator of 9 based on the fc„ largest 
observations: 

fe„-i /fe„-i 

^n=Yl - l0g(^«-.+ l,«) / J2 (log log (^/*) - log log (^/*) ' 

1=1 / i=l 

where 

Cn = ^ _ ^ log log W*) • 

This estimator is similar to the Zipf estimator for the extreme- value index proposed by Kratz 
and Resnick and Schultze and Steinebach [52] . We prove in Section 2] that 9^ belongs to 
family ([3]) and thus apply Theorem [1] to obtain its asymptotic normality: 
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Corollary 2 Suppose (A.l) and (A. 2) hold. If (kn) is an intermediate sequence such that 
fcyVax{&(log(n)),log2(fc„)/log(n)} 0, then k]J^{ef^ - 9) A M [0,29^). 



4 Proofs 

Throughout this section, we assume that (fc„) is an intermediate sequence and, for the sake 
of simplicity, we note k for fc„. Let us also introduce Kp{X) = J^u''~^du for A > 1 and 
J{x) — W{1 — x) for X S (0,1). The following notations will prove useful: En-k+\,n is 
the (n — fc + l)th order statistics associated to n independent standard exponential variables 
and (-F'i,fc-i)i<i<A;-i are order statistics, independent from En-k+i,n, generated by fc — 1 
independent standard exponential variables. The next lemma presents an expansion of 9n{a). 



Lemma 1 Under (A.l) and (A. 2), 9n{a) has the same distribution as 

9Ti''"^ + (1 + OpilMEr.^k+l.nH^'"^ 
n 

where we have defined 

T^^^ = Y^ai,„(loglog(n/i) -loglog(n/fc)) and 

2 = 1 

Proof : Using the quantile transform, the order statistics {Xi^n)i<i<n have the same distri- 
bution as {H^ {Ei^n))i<i<n- Thus, (A. 2) yields that the numerator of 9n{a) in ([3]) has the 
same distribution as 

fe — 1 ^ T? \ 1 ^z3 / F 



^Vai.nlog ( f " ] + (1 + op(l))5(£;„_fc+i,„)--^ Va^,„i^, 



The Renyi representation asserts that (En-i+i.nl ^n-k+\.n)\<i<k-\ has the same distribution 
as (1 ^ Fk-i^k-\l En~k+\,n)\<i<k-\, sec jlj, p. 72. Therefore, the numerator of 6'„(q;) has the 
same distribution as 

k-\ / p 



k—\ '' \ En-k+l. 

(l + op(l))6(£;„_fe+i,„)^— Y^a,;^„xJ 1 , ^ 



k—i,k — l 



z=l ^ 

Changing z to A: — i in the above formula and remarking that Kg is the logarithm function 
conclude the proof. ■ 

The following lemma provides an expansion of 

fe-i 



k-^ 

1=1 



—J ~ log log {n/k j), 



which frequently appears in the proofs. 
Lemma 2 The following expansion holds: 



log{n/k) { V / Vlog('^/fc) 
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Proof : We write r„ as the sum 

log(n/fc) /fc - 1 ^ k-l^^ \ ^ \ + \og{n/k) ) \og{n/k) ] 

Since 



■1^ 5Z log(*/fc) = log 



Stirling's formula shows that the first term is 



1 f,^o('^^^'^ 



log(n/fc) V \ 2k 

The inequality —x^jl < log(l + a;) — x < 0, valid for nonnegative x shows that the second 
term is of order at most 



log\n/k)k-l^ \log\n/k) 



i=l 

since the above Riemann sum converges to 2 as fc cxd. The result follows. ■ 

The next lemmas are dedicated to the study of the different terms appearing in Lemma [TJ 
First, we focus on the non-random term Tn^K 

Lemma 3 Under (A.1)-(A.4), the following expansion hold: 



7^(1) ^ 



{l + 0(log(fc)fc-^)+o(^^)+0(||e||_) 



log(n/fc) 

Proof : Clearly, T^^^ can be rewritten as the sum 

1 , / , log(fcA)\ , 1 A , \og{k/z) 



log{n/k) J k — 1 -^—^ \ log{n/k) 

The absolute value of the first term is less than HeHn^ooTn which is O(||£||„^oo/log(n/fc)), by 
Lemma [21 The second term can be expanded as 

'^^^ ^ '^-'^ ' ' \ogikli) \ \og{k/i) 



1 1 r / 

, y"W(i/k)\og(k/i) + y"w(i/k)\\og{l . ^ , ,,,, , , 

log n//c k-1^ \ / > &\ / J I V log^/fc J login k) 



(1.1) 



, m(l,2) 



\og{n/k) 



For X G (0, 1), define H{x) = Ty(a;) log(l/a;). The Riemann sum T^^'^'' can be compared to 
KW) by: 

|T(i^i) - f,{W)\ <^J2 + 1^(^)1^^ + O(VA^)- (7) 

i=l i/k<x<(i+l)/k Jo 

Assumption (A. 4) implies that there exists a positive M' such that \H'\ < A/'Id^'^^ on the 
open unit interval, and thus the first term of ^ is bounded above by 



2fc 



i^Ji/k^ dt + fc ^ I O |fc ^ log(fc)) otherwise. 
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Assumption (A. 4) also yields \H\ < Mid *log(l/a:) on the open unit interval and thus the 
second term in ([7]) is O (fc"*^^ log(A:)). It follows that 

Ti'''^ =^l{W)+0{k''-'\og{k)). (8) 

Besides, the well-known inequality | log(l + x) — x| < valid for all nonnegative x together 

with (A. 4) show that |r„ ' '| is bounded by 

\T,['''^\ < „ f ,,,.\ j2i^/kr'^og'im = O ( ) , (9) 

2 log {n/k) ^ 1 ^ V log [n/k) J 

since the above Riemann sum converges to a finite integral. Collecting and (O gives the 
result. ■ 

Second, we focus on the random term T^i^'^'K 

Lemma 4 Let ^ be standard Gaussian random variable. Under (A.1)-(A.4), the following 
expansion hold for all non-positive p: 

En-k+l,n [ KW) \\og {n/k) J 

Proof : Note that xi^'''^ can be written as the sum 



k 



(10) 



Since < Kp{l + x) < x for all nonnegative x, the absolute value of the first term is bounded 
by 

Mn,^T^ E (l + < \\e\\n,ooT:^ E 1?^^' 

which has the same distribution as 

II II 1 fc-i 

e 



E - ir^ — Op (Ikll-oo) ' 

k+l,n K — I ^ tjn-k+l.n 



En-k+l _ 

from the law of large numbers. The second term of pH)) can be expanded as 

1 k — A 

t t ■^-^ N „ i N I ^, / . ^i,k-l \ ^i,k-l 



-E„_fc+i^„ fc — 1 ' fc — 1 1^ \ En-k+l,nJ En^k+1, 

7.(2,P,1) 



En-k+l,' 



Now, (A. 3) and (A. 4) imply that the L-statistics T^''''^^ satisfies the conditions of TP and 
thus is asymptotically Gaussian. More precisely, we have 

T,(2.P^i) £ ^{W) + a(Ty)fc"i/2^(l + op(l)). (11) 

The upper bound on T,!^''''^'' is obtained by remarking that \Kp{l + x) — x\ < (1 — p)x^ jl for 
all nonnegative x. It follows that Tn'^'"^^ is bounded above by 

\ Ai-fe+l,ri/ ^n-fc+l,n ^-C'„_i._Li „ K — i 



z— 1 



n — /c+l,n j— ]^ 
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Now, En-k+i,n is equivalent to log(n/fc) in probability and 



— Y^\J[^/k)\Fl^_,^Op{l), 



from the results of [211 on L-statistics. Thus 

and then collecting PT|) and (|12l) . the second term of ([T(I| is 

^ ^i{W)+a{W)k~^/^i{l + op{l)) + 0^ ^ 



and the result follows. ■ 
We are now in position to prove Theorem [1] and Corollary [51 

Proof of Theorem [TJ From Lemma[Tl k-^/^{6n{a) — 0) has the same distribution as 

/ 7^(2,0) \ rp(2,p) 

Now, En^k+i,n is equivalent to log(n//c) in probability which is also equivalent to log(n), 
see Lemma 5.1 in [13j. Since |6| is regularly varying (see [U]), b{En-k+i,n) is equivalent to 
6(log(n)) in probability. As a consequence, fci/2(0"„(a) -61) has the same distribution as 

0^1/2 MiL_ _ 1 + fci/2fo(iog(,)):^(i + op(i)). (13) 

Let us consider the first term of this sum. Lemma |3l Lemma |4] and condition (j4]) entail that, 
for all non-positive p, the ratio Tn'''''/Tn'' has the same distribution as 



Now, Lemma 1 in [16] asserts a central limit theorem for order statistics of an exponential 
sample, and thus 

log(n/fc) , ^ ^ / fc-V^ 
£;„_fc+i,„ Vlog(n)^ 

Consequently, the first term of (fTS]) converges in distribution to M {Q ^ 9'^ a'^ {W) / ji'^ {W)) . We 
also have that the second term of (fT3|) converges to A in probability and the result is proved. 



Proof of Corollary [2l First remark that ^ can be rewritten as 

fc-i /fc-i 
= 51 "«>(log(-^»-»+i,») - log(X„_fc+i,„)) / ^ a,?„(loglog [n/i) - loglog (n/fc)) , 

i=l / i=l 

where 

z 



= log('V^) (loglog {n/i) ~ Cn) 

log(fe/^) 
log(n/fc) 



log(n//c) log 1 + 



= log(A:/*) + O f ^^^t) - log(n/fc)r„, 
V log(?^) / 
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uniformly on i — l,...,k with Lemma [5] Therefore, we have af„ — W{i/k) + with 
W{x) = — (log(x) + 1) and Si^n — 0(log^(fc)/ log(n)) + 0(log(fc)/fc), uniformly on i = 1, . . . , fc. 
Then, it is easy to check that W satisfies conditions (A. 3) and (A. 4) and that Ijl{W) — 1 
and cr2(H^) = 2. ■ 





1 - F{x) 


9 


b{x) 




Weibull 

W(a,A) 


exp(-(a;/A)") 


1/a 







Extended Weibull 
£W{T,fi,^) 


r{x) exp(— /3a;'^) 


1/r 


7 log X 

X 


-1 


Gaussian 




1/2 


1 log X 

4 a: 


-1 


Gamma 
r(/3,a) 




1 


/i \ log X 

(1 -a)^ 


-1 


Benktander II 


a;^-icxp(-fa;^) 


1/r 


(1— r) log a: 
a: 


-1 


Logistic 
C 


2 

1+cxp X 


1 


_log2 

a: 


-1 


Extreme Value 
£VD{^ji) 


1 — exp(— exp(/i — x)) 


1 


_i£ 

a; 


-1 



Table 1: Some Weibull tail-distributions 
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